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Let A = -^i=(X T X — pi n ) where X is a p x n matrix, consisting of independent and identically 
distributed (i.i.d.) real random variables Xij with mean zero and variance one. When p/n —>• oo, 
under fourth moment conditions a central limit theorem (CLT) for linear spectral statistics 
(LSS) of A defined by the eigenvalues is established. We also explore its applications in testing 
whether a population covariance matrix is an identity matrix. 
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1. Introduction 

The last few decades have seen explosive growth in data analysis, due to the rapid 
development of modern information technology. We are now in a setting where many 
very important data analysis problems are high-dimensional. In many scientific areas, 
the data dimension can even be much larger than the sample size. For example, in 
micro-array expression, the number of genes can be tens of thousands or hundreds of 
thousands while there are only hundreds of samples. Such kind of data also arises in 
genetic, proteomic, functional magnetic resonance imaging studies and so on (see Chen 
et al. [11], Donoho [13], Fan and Fan [14]). 

The main purpose of this paper is to establish a central limit theorem (CLT) of linear 
functionals of eigenvalues of the sample covariance matrix when the dimension p is much 
larger than the sample size n. Consider the sample covariance matrix S = ^-XX T , where 
X = ( Xij) pxn and , i = 1,... ,p, j = 1 ,...,n are i.i.d. real random variables with mean 
zero and variance one. As we know, linear functionals of eigenvalues of S are closely 
related to its empirical spectral distribution (ESD) function F s (x). Here for any n x n 
Hermitian matrix M with real eigenvalues Ai,..., \ n , the empirical spectral distribution 
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of M is defined by 

1 ” 

3 = 1 

where /(•) is the indicator function. However, it is inappropriate to investigate F s (x) 
when p/n —> oo since S has (p — n) zero eigenvalues and hence F s (x ) converges to a 
degenerate distribution with probability one. Note that the eigenvalues of S are the 
same as those of —X T X except (p — n) zero eigenvalues. Thus, instead, we turn to the 
eigenvalues of ^X^X and re-normalize it as 


A = 




( 1 . 1 ) 


where I n is the identity matrix of order n. 

The first breakthrough regarding the ESD of A was made in Bai et al. [7] . They proved 
that with probability one 

F a (x)^-F(x), 

which is the so-called semicircle law with the density 


-r-v/4 — x 2 , if |x| < 2, 

2 7T 

0, if \x\ > 2. 


( 1 . 2 ) 


In random matrix theory, F(x) is referred to as the limiting spectral distribution (LSD) 
of A. For such matrices, Chen and Pan [10] proved that the largest eigenvalue converges 
to the right endpoint of the support of F(x) with probability one. When Xu ~ N( 0,1), 
Karoui [20] reported that the largest eigenvalue of XX T after properly centering and 
scaling converges in distribution to the Tracy-Widom law, and Birke and Dette [9] estab¬ 
lished central limit theorems for the quadratic function of the eigenvalues of A. Recently, 
Pan and Gao [24] further derived the LSD of a general form of (1.1), which is determined 
by its Stieltjes transform. Here, the Stieltjes transform for any distribution function G 
is given by 

m G {z) = f 1 dG'(z), S(z)>0, 

J x - z 

where 3(z) represents the imaginary part of 2 . 

Gaussian fluctuations in random matrices are investigated by different authors, starting 
with Costin and Lebowitz [12], Johansson [18] considered an extended random ensembles 
whose entries follow a specific class of densities and established a CLT of the linear 
spectral statistics (LSS). Recently, a CLT for LSS of sample covariance matrices is studied 
by Bai and Silverstein [5] and of Wigner matrices is studied by Bai and Yao [6] . 

We introduce some notation before stating our results. Denote the Stieltjes transform 
of the semicircle law F by m(z). 3 ( 2 ) is used to denote the imaginary part of a complex 
number 2 . For any given square matrix B, let trB and B denote the trace and the 
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complex conjugate matrix of B, respectively. The norm ||B|| represents the spectral 


norm of B, that is, ||B|| = y Ai(BB) where Ai(BB) means the maximum eigenvalue of 

BB. The notation means “convergence in distribution to”. Let 5? denote any open 
region on the real plane including [—2,2], which is the support of F(x), and ^ be the 
set of functions which are analytic on 5?. For any / £ .At. define 


G n (f) = n f f(x)d(F A (x)-F(x)) - <f f(-m-m 1 )X n (m)~ —^-d m, 
J-oo 2m J ]ml=p m 2 

(1.3) 


where 


X n (m) = 


-b + vb 2 -aac 

2A ’ 


A = m— \ / — (1+ to 2 ), 


B = to 2 — 1 — ,/ —to( 1 + 2 to 2 ) 


C =— 


1 — TO 2 


'^4-2 ) - J-m , 


(1.4) 


^4 = EXfi and \JB 2 — 4AC is a complex number whose imaginary part has the same 
sign as that of B. The integral’s contour is taken as |m| = p with p < 1. 

Let {Tfc} be the family of Chebyshcv polynomials, which is defined as To(x) = 
l,Tj(x) = x and Tfc+i(x) = 2 xT k (x) — T k -i(x). To give an alternative way of calculating 
the asymptotic covariance of X(f) in Theorem 1.1 below, for any / £ and any integer 
k > 0, we define 


*fc(/) = ^/ f( 2 cos 6 )e ike dd 


h 


f (2 cos 0 ) cos kd d 9 = 


7T 


f(2x)T k (x) 




: dx. 


The main result is formulated in the following. 


Theorem 1.1. Suppose that 

(a) X = (Xij) pxn where {Xij : i = 1,2,... ,p; j = 1,2,..., n} are i.i.d. real random 
variables with EX n = 0 ,EX 2 ± = 1 and v 4 = EX^ x < 00 . 

(bl) n/p —> 0 as n —> 00 . 

Then, for any / 1 , • • •, fk £ , the finite-dimensional random vector (G n (/i),..., G n (f k )) 

converges weakly to a Gaussian vector (Y(fi ),..., Y(fk)) with mean function EY (/) = 0 
and covariance function 

OO 

cov(F(/ 1 ),F(/ 2 )) = (i/ 4 - 3)* 1 (/ 1 )’i ’i(h) + 2£>* fc (/ 1 )* fc (/ 2 ) (1.5) 

k= 1 


4tt 2 


f[(x)f 2 (y)H(x,y)dxdy, 


—2 J -2 


( 1 . 6 ) 
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where 


H (. x, y) = ( 1/4 - 3) \/4 - x 2 \/A - y 2 + 2 log 


/ i-xy+y/{i- x 2 )(4 - y 2 ) \ 
\4 -xy- \/(4 — a; 2 )(4 — y 2 )) 


Remark 1.1. Note that X n (m) in (1.3) and X_ n (m) = B 4AC are the ^ w0 r0 °t s of 
the equation Ax 2 + Bx + C = 0. Since n/p —► 0, an easy calculation shows X n (m) = o(l) 
and X_ n (m ) = 1 ~ m —|-o(l). Hence in practice, one may implement the mean correction 
in (1.3) by taking 


X n (m) = min 


—B + VB 2 - 4 AC 


2 A 


-B - VB 2 - 4 AC 


2 A 


and m = pe ld with 0 £ [—27t, 27t] and 0 < p < 1. 

The mean correction term, the last term in (1.3), can be simplified when n 3 /p = 0(1). 
Indeed, if n 3 /p = 0(1), we have 4.4C = o(l), B = m 2 — 1. By (1.4), 


nX n (m) = n 


-B + VB 2 - 4.AC 


—2nC 


2 A 


B + VB 2 - 4.4C 


1 — m 2 V 1 — m. 


— 1/4 — 2 


1 ^ 4 

n° rn 


pi — m 2 


+ o(l). 


Hence, by using the same calculation as that in Section 5.1 of Bai and Yao [ 6 ], we have 


n f 

J\m\=) 


_ 777 .^ 

/(—m — m~ 1 )X n (m )-■=— dm 


= --{> 

J\m\=p 


f(—m — m 1 )m 


1 — 


— v 4 — 2 


= -l(/(2) + /(-2))-1^/(2*) 


1 n 3 f 1 4a ; 3 — 3x , 

/( 2 x)^==dx 
7T I/ P J~ 1 VI - Z 2 


2(i / 4 — 3)a ; 2 — ( Vi — 


dm + o(l) 
1 

\/l — x 2 


da; (1.7) 


|(/(2) + /(—2)) - l*o(/) + (1/4 - 3)tt 2 (/) 


-^3(/)+0(l). 


Define 


/■+°° /+T 

Qn(/)=n J f(x)d(F A (x)-F(x))-Jj* 3 (f). 


( 1 . 8 ) 
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Under the condition n 3 /p = 0(1), we then give a simple and explicit expression of the 
mean correction term of (1.3) in the following corollary. 


Corollary 1 . 1 . Suppose that 

(a) X = (Xij) pxn where {Xy : i = 1,2,... ,p; j = 1,2,..., n} are i.i.d. real random 
variables with EX n = 0, EXf ± = 1 and 1/4 = EXf x < 00 . 

(b 2 ) n 3 /p = 0(1) asn—> 00 . 

Then, for any / 1 , G ^, the finite-dimensional random vector (Q n (/i), ■ • •, Qn(fk)) 

converges weakly to a Gaussian vector (X(/i),.. .,X(/*,)) with mean function 

EX{f) = ±(/(2) + /(—2)) - (/) + («/ 4 - 3)* 2 (/) (1-9) 

and covariance function cov(X(/),X(g)) being the same as that given in (1.5) and (1-6). 


Remark 1.2. The result of Bai et al. [2] suggests that, for large p and n with p/n —> 00 , 
the matrix yjnA. is close toanxn Wigner matrix although its entries are not independent 
but weakly dependent. It is then reasonable to conjecture that the CLT for the LSS of 
A resembles that of a Wigner matrix described in Bai and Yao [ 6 ]. More precisely, by 
writing A= ^(t«jj), where wu = (sfsj —p)/y/p, uiij = sfSj / y/p for i^j and Sj is the 
jth column of X, we have 

Var(ion) = 1/4 — 1, Var(wi 2 ) = 1, E(w\ 2 - l ) 2 = -(^4 - 3). 

V 

Then, (1.9), (1.5) and (1.6) are consistent with (1.4), (1.5) and (1.6) of Bai and Yao [ 6 ], 
respectively, by taking their parameters as a 2 = V 4 — 1 , k = 2 (the real variable case) and 
8 = 0 . 

However, we remark that the mean correction term of Q n (f), the last term of ( 1 . 8 ), 
cannot be speculated from the result of Bai and Yao [ 6 ] . Note that this correction term 
will vanish in the case of the function / to be even or n 3 /p —X 0. By the definition of 
*&(/), one may verify that 


* 3 (/) = 



dx, 


VT~: 


: dx. 


Remark 1.3. If we interchange the roles of p and n, Birke and Dette [9] established 
the CLT for Q n (f) in their Theorem 3.4 when / = x 2 and Xy ~ iV(0,1). We below show 
that our Corollary 1.1 can recover their result. First, since / = x 2 is an even function, it 
implies that the last term of (1.8) is exactly zero. Therefore, the mean in Theorem 3.4 
of Birke and Dette [9] is the same as (1.9), which equals one. Second, the variance in 
Theorem 3.4 of Birke and Dette [9] is also consistent with (1.5). In fact, the variance 
of Birke and Dette [9] equals 4 when taking their parameter y = 0. On the other hand, 
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since Xjj ~ ZV(0,1), we have v 4 = 3 and the first term of (1.5) is zero. Furthermore, by a 
direct evaluation, we obtain that 

1 r n 1 r n 

U/i (/) = x— / 4cos 3 0d0 =— / (cos30 + 3cos0)d0 = O, 

2n J-7i 2n J-n 

1 r n 1 r n 

^2 (/) = — / 4 cos 2 0 cos 20 d0 = — / (cos40 +1+ 2cos20)d0= 1, 

J2tz J _ 7T 

1 r 71 1 r n 

'&k(f) = 7 r- / 4 cos 2 0 cos kd dd = — / 2(cos20 + 1) cos/c0d0 

J —^ 271 J _ 71 - 

1 C 

= — / (cos(fc — 2)0 + cos(fc + 2)0 + 2cosfc0) d0 = 0, for fc > 3. 

2nJ_ n 


It implies that cov(X(cc 2 ),X(a; 2 )) = 4, which equals the variance of Birke and Dette 


The main contribution of this paper is summarized as follows. We have established the 
central limit theorems of linear spectral statistics of the eigenvalues of the normalized 
sample covariance matrices when both the dimension and the sample size go to infinity 
with the dimension dominating the sample size (for the case p/n —> 00 ). Theorem 1.1 
and Corollary 1.1 are both applicable to the data with the dimension dominating the 
sample size while Corollary 1.1 provides a simplified correction term (hence, CLT) in the 
ultrahigh dimension cases (n 3 /p = 0(l)). Such an asymptotic theory complements the 
results of Bai and Silverstein [5] and Pan [23] for the case p/n —> c £ (0, 00 ) and Bai and 
Yao [6] for Wigner matrix. 

This paper is organized as follows. Section 2 provides a calibration of the mean cor¬ 
rection term in (1.3), runs simulations to check the accuracy of the calibrated CLTs in 
Theorem 1.1, and considers a statistical application of Theorem 1.1 and a real data anal¬ 
ysis. Section 3 gives the strategy of proving Theorem 1.1 and two intermediate results, 
Propositions 3.1 and 3.2, and truncation steps of the underlying random variables are 
given as well. Some preliminary results are given in Section 4. Sections 5 and 6 are de¬ 
voted to the proof of Proposition 3.1. We present the proof of Proposition 3.2 in Section 7. 
Section 8 derives mean and covariance in Theorem 1.1. 


2. Calibration, application and empirical studies 

Section 2.1 considers a calibration to the mean correction term of (1.3). A statistical 
application is performed in Section 2.2 and the empirical studies are carried out in Sec¬ 
tion 2.3. 

2.1. Calibration of the mean correction term in (1.3) 

Theorem 1.1 provides a CLT for G n (f) under the general framework p/n—> 00 , which 
only requires zero mean, unit variance and the bounded fourth moment. However, the 
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simulation results show that the asymptotic distributions of G n (/), especially the asymp¬ 
totic means, are sensitive to the skewness and the kurtosis of the random variables for 
some particular functions /, for example, f{x) = ^ x(x 2 — 3). This phenomenon is caused 
by the slow convergence rate of EG n (f ) to zero, which is illustrated as follows. Suppose 
that EX < oo. We then have \EG n (f)\ = O (y/n/p) + 0(1 /y/n) by the arguments in 
Section 6 . Also, the remaining terms (see (2.1) below) have a coefficient (y 4 — 1 )\Jn/p 
which converges to zero theoretically since p/n —> 00. However, if n = 100, p = n 2 and the 
variables Xij are from central exp(l) then (1/4 — 1 )yjn/p could be as big as 0 . 8 . 

In view of this, we will regain such terms and give a calibration for the mean correction 
term in (1.3). From Section 6 , we observe that the convergence rate of \EG n (f) \ relies on 
the rate of \nEco n — m 3 (z)(m'(z) + 1 /4 — 2)| in Lemma 6.1. By the arguments in Section 6 , 
only S22 below (6.13) has the coefficient {y4 — 1 )\Jn/p. A simply calculation implies that 

S22 = —2(1/4 - l)y / n/pm(z) + o(l). (2.1) 


Hence, the limit of nEu> n is calibrated as 

nEoj n = m 3 (z)[u 4 — 2 + m'(z) — 2 ( 1/4 — 1 )y/n/pm(z)] + o(l). (2.2) 


We then calibrate G n (f) as 


/ “TOO 

/ (x) d(F A (x) — F(x)) 

-OO 

/(-m-m- 1 )^ 15 ^) 

J \m\=p 

where, via ( 2 . 2 ), 


I f(—m — m~ 1 lA? alib ('m')-—^-d m, 




-S + VS 2 - 4 AC Calib 


^Calib _ 


2 A 


v 4 — 2 + 


1 — rn‘ 


— 2 ( 1/4 ~ \)m\fnfp 


- * / —m 
V 


(2.3) 


(2.4) 


A, B are defined in (1.4) and \JB 2 — 4AC CaIlb is a complex number whose imaginary part 
has the same sign as that of B. Theorem 1.1 still holds if we replace G n (f) with G(( allb (/). 

We next perform a simulation study to check the accuracy of the CLT in Theorem 1.1 
with G n (/) replaced by the calibrated expression G® allb (/) in (2.3). Two combinations 
of ( p,n ), p = n 2 ,n 2 5 , and the test function f(x) = \x{x 2 — 3) are considered in the 
simulations, as suggested by one of the referees. To inspect the impact of the skewness 
and the kurtosis of the variables, we use three types of random variables, 1V(0,1), central 
exp(l) and central f( 6 ). The skewnesses of these variables are 0, 2 and 0 while the fourth 
moments of these variables are 3, 9 and 6 , respectively. The empirical means and empirical 
standard deviations of G^ allb (/)/(Var(F (/))) 1 / 2 from 1000 independent replications are 
shown in Table 1. 
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Table 1. Empirical means of GS allb (/)/(Var(y(/))) 1 / 2 (cf. (2.3)) for the function f(x) = 
| x(x 2 — 3) with the corresponding standard deviations in the parentheses 


n 

50 

100 

150 

200 

N( 0,1) 
exp(l) 

t{ 6) 

p = n 2 

-0.314 (1.227) 
-0.088 (2.476) 
-0.084 (2.813) 

„ „2.5 

-0.221 (1.038) 
-0.079 (1.447) 
-0.077 (1.541) 

-0.188 (1.051) 
-0.140 (1.400) 
-0.095 (1.246) 

-0.093 (0.940) 
-0.161 (1.154) 
-0.0897 (1.104) 

N( 0,1) 
exp(l) 
t(6) 

p — n 

-0.068 (1.049) 
-0.049 (1.879) 
-0.075 (1.693) 

-0.053 (1.077) 
-0.029 (1.390) 
0.050 (1.252) 

-0.0476 (0.944) 
-0.046 (1.162) 
-0.044 (1.145) 

-0.016 (1.045) 
-0.045 (1.156) 
-0.027 (1.044) 


It is observed from Table 1 that both the empirical means and standard deviations 
for N( 0,1) random variables are very accurate. The empirical means for central exp(l) 
and central t(6) also show their good accuracy. We note that the standard deviations 
for central exp(l) and central f(6) random variables are not good when n is small (e.g., 
n = 50). But it gradually tends to 1 as the sample size n increases. 

Q-Q plots are employed to illustrate the accuracy of the normal approximation in 
Figures 1 and 2 corresponding to the scenarios p = n 2 and p = n 2 ' 5 , respectively. In each 
figure, Q-Q plots from left to right correspond ton = 50,100,150,200, respectively with 
random variables generated from 1V(0,1) (V), central exp(l) (A) and central t(6 ) (+). 
We observe the same phenomenon that the normal approximation is very accurate for 
normal variables while the approximation is gradually better when n increases for central 
exp(l) and f(6) variables. 


n=50, N(0,1)(v), exp(1)( ),t(6)(+) 


n=100, N(0,1)(v), exp(1)( ), t(6)(+) 


n=200, N(0,1)(v),exp(1)( ), t(6)( ) 





Figure 1. The Q-Q plots of the standard Gaussian distribution versus 
GS ahb (/)/(Var(y(/))) 1 A based on the sample generating from 1V(0,1) (V), standardized 
exp(l) (A) and standardized t(6) (+) with the sample sizes n = 50,100,200 from left to right 
and the dimension p = n 2 . 
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n=50, N(0,1)(v), exp(1)( ), t(6)(+) 


n=100, N(0,1)(v), exp(1)( ), t(6)(+) 


n=200, N(0,1)(v), exp(1)( ), t(6)(+) 



-2 


-3 -2 -1 


-3 - 2-10 1 

Normal quantile 


Normal quantile 


Normal quantile 


Figure 2. The Q-Q plots of the standard Gaussian distribution versus 
G'S allb (/)/(Var(l ^(/))) 1 / 2 based on the sample generating from AT(0,1) (V), standardized 
exp(l) (A) and standardized t( 6 ) (+) with the sample sizes n = 50,100,200 from left to right 
and the dimension p = n 2 ' 5 . 

2.2. Application of CLTs to hypothesis test 

This subsection is to consider an application of Theorem 1.1 which is about hypothesis 
testing for the covariance matrix. Suppose that y = Ts is a p-dimensional vector where T 
is a p x p matrix with positive eigenvalues and the entries of s are i.i.d. random variables 
with mean zero and variance one. Hence, the covariance matrix of y is X = TT t . Suppose 
that one wishes to test the hypothesis 


(2.5) 


H 0 :Z = I pi -Hi: £ 7 ^ Ip- 


Based on the i.i.d. samples yi, ... ,y n (from y), many authors have considered (2.5) in 
terms of the relationship of p and n. For example, John [19] and Nagao [22] considered 
the fixed-dimensional case; Ledoit and Wolf [21], Fisher et al. [16] and Bai et al. [2] 
studied the case of ^ —>■ c S (0, oo); Srivastava [26], Srivastava, Kollo and von Rosen [27], 
Fisher [15] and Chen et al. [11] proposed the testing statistics which can accommodate 
large p and small n. 

We are interesting in testing (2.5) in the setting of £ —> oo. As in Ledoit and Wolf [21] 
and Birke and Dette [9] , we set f = x 2 . We then propose the following test statistic for 
the hypothesis of (2.5): 



( 2 . 6 ) 



where X% alih {m) is given in (2.4) and B = ,/f (±Y T Y - I n ) is the normalized sample 
covariance matrix with Y = (yi, ... ,y n ). The asymptotic mean and variance of L n are 
0 and 1, respectively, see Theorem 1.1 or Remark 1.3 for details. Since there is no close 
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form for the mean correction term in (2.6), we use Matlab to calculate this correction 
term. It shows that as n/p —> 0, 

——<1 (m + 7 n^ 1 ) 2 A^ ahb (m)-—dm = ^4 — 2 . 

2 m /| m | =p m 2 


We also note the fact that 
E 




= L/trBB 7 — n] = 1/4 — 2 . 


Thus, we use the following test statistic in the simulations: 


In — 


1 


J x 2 d F B (x) — J x 2 dF(x) \ — (z /4 — 2 ) 


= i(trBB T -n-( i/ 4 -2)). (2.7) 


Since T t T = I p is equivalent to TT t = I p , under the null hypothesis H 0 in (2.5), we have 


L n ^N( 0,1). (2.8) 

By the law of large numbers, a consistent estimator of 1/4 is P 4 = j ^ under the null 

hypothesis Uo- By Slutsky’s theorem, (2.8) also holds if we replace 1/4 of (2.7) with P 4 . 

The numerical performance of the proposed statistic L n is carried out by Monte Carlo 
simulations. Let Z a / 2 and Zi_ a / 2 , respectively, be the lOOct/2% and 100(1 — a/2)% 
quantiles of the asymptotic null distribution of the test statistic L n . With T replications 
of the data set simulated under the null hypothesis, we calculate the empirical size as 

. {#ir'‘ < z a/2 } +{#t“ uU > z x _ a/2 } 

a =- - -—- - —, 

T 

where # denotes the number and T ™ 11 represents the values of the test statistic L n based 
on the data set simulated under the null hypothesis. The empirical power is calculated by 

3 _ {#if er < Z a/2 } + {#Lf er > Z x _ a/2 } 

P jn 


where Lj) lter represents the values of the test statistic L n based on the data set simulated 
under the alternative hypothesis. In our simulations, we fix T = 1000 as the number of 
replications and set the nominal significance level a = 5%. By asymptotic normality, we 
have Z a /2 = —1.96 and ^i-a /2 = 1.96. 

Our proposed test is intended for the situation “large p, small n”. To inspect the 
impact caused by the sample size and/or the dimension, we set 


n = 20,40,60,80, 

p = 600,1500,3000,5500,8000,10 000. 


The entries of s are generated from three types of distributions, Gaussian distribution, 
standardized Gamma(4, 0.5) and Bernoulli distribution with P(Xij = ±1) = 0.5. 
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The following two types of covariance matrices are considered in the simulations to 
investigate the empirical power of the test. 

1. (Diagonal covariance.) E = diag(v / 21[ I / P ], li_[„ p ]), where v = 0.08 or v = 0.25, [a] 
denotes the largest integer that is not greater than a. 

2. (Banded covariance.) E = diag(Zli,diag(l p _r„ 2P ])), where A\ is a [v 2 p] x [v 2 p\ tridi¬ 
agonal symmetric matrix with the diagonal elements being equal to 1 and elements 
below and above the diagonal all being equal to v\. 

Since the test in Chen et al. [11] accommodates a wider class of variates and has less 
restrictions on the ratio p/n, we below compare performance of our test with that of 
Chen et al. [11]. To simplify the notation, denote their test by the CZZ test. Table 2 
reports empirical sizes of the proposed test and of the CZZ test for the preceding three 
distributions. We observe from Table 2 that the sizes of both tests are roughly the same, 
when the underlying variables are normally or Bernoulli distributed. It seems that the 
CZZ test looks better for skewed data, for example, gamma distribution. We believe 


Table 2. Empirical sizes of CZZ test and L n at the significant level a = 5% for normal, gamma, 
Bernoulli random vectors 


p 

CZZ test 

n 



L n 

n 




20 

40 

60 

80 

20 

40 

60 

80 


Normal random vectors 






600 

0.069 

0.071 

0.052 

0.052 

0.063 

0.077 

0.066 

0.082 

1500 

0.057 

0.059 

0.061 

0.059 

0.055 

0.058 

0.058 

0.062 

3000 

0.067 

0.068 

0.057 

0.053 

0.048 

0.067 

0.056 

0.052 

5500 

0.064 

0.06 

0.067 

0.058 

0.054 

0.055 

0.071 

0.068 

8000 

0.071 

0.062 

0.062 

0.054 

0.055 

0.049 

0.06 

0.059 

10 000 

0.055 

0.059 

0.063 

0.06 

0.037 

0.058 

0.057 

0.054 


Gamma 

random vectors 






600 

0.055 

0.073 

0.056 

0.062 

0.103 

0.119 

0.125 

0.123 

1500 

0.064 

0.047 

0.059 

0.059 

0.094 

0.072 

0.072 

0.088 

3000 

0.069 

0.071 

0.059 

0.052 

0.066 

0.074 

0.071 

0.061 

5500 

0.065 

0.069 

0.048 

0.041 

0.077 

0.073 

0.047 

0.045 

8000 

0.069 

0.065 

0.07 

0.053 

0.078 

0.075 

0.063 

0.059 

10 000 

0.072 

0.06 

0.06 

0.057 

0.078 

0.082 

0.065 

0.06 


Bernoulli random vectors 






600 

0.078 

0.079 

0.056 

0.037 

0.048 

0.064 

0.046 

0.037 

1500 

0.065 

0.050 

0.051 

0.053 

0.039 

0.040 

0.049 

0.050 

3000 

0.048 

0.053 

0.058 

0.060 

0.040 

0.052 

0.052 

0.056 

5500 

0.059 

0.061 

0.059 

0.042 

0.040 

0.052 

0.060 

0.040 

8000 

0.065 

0.074 

0.065 

0.059 

0.046 

0.052 

0.05 

0.051 

10 000 

0.07 

0.057 

0.047 

0.048 

0.044 

0.037 

0.038 

0.047 
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additional corrections such as the Edgeworth expansion will be helpful, which is beyond 
the scope of this paper. However, our test still performs well for skewed data if p 3> n. 

Table 3 to Table 5 summarize the empirical powers of the proposed tests as well as 
those of the CZZ test for both the diagonal and the banded covariance matrix. Table 3 


Table 3. Empirical powers of CZZ test and L n at the significant level a = 5% for normal 
random vectors. Two types of population covariance matrices are considered. In the first case, 
Ei = diag(2 x l[„ p ], l p -[i,p]) for u = 0.08 and v = 0.25, respectively. In the second case, E 2 = 
diag(Ti,diag(lj,_r„ 2 j,])), where Ai is a [v 2 p] x [V 2 p] tridiagonal symmetric matrix with diagonal 
elements equal to 1 and elements beside diagonal all equal to Vi for vi = 0.5, V 2 = 0.8 and 
vi = 0.5, V 2 = 0.4, respectively 


p 

CZZ test 

n 



Ln 

n 




20 

40 

60 

80 

20 

40 

60 

80 


Normal 

random vectors (u - 

= 0.08) 





600 

0.186 

0.392 

0.648 

0.826 

0.932 

1 

1 

1 

1500 

0.179 

0.397 

0.642 

0.822 

0.999 

1 

1 

1 

3000 

0.197 

0.374 

0.615 

0.867 

1.000 

1 

1 

1 

5500 

0.225 

0.382 

0.615 

0.85 

1 

1 

1 

1 

8000 

0.203 

0.391 

0.638 

0.843 

1 

1 

1 

1 

10 000 

0.204 

0.381 

0.639 

0.835 

1 

1 

1 

1 


Normal 

random vectors (y = 

= 0.25) 





600 

0.571 

0.952 

0.997 

1 

1 

1 

1 

1 

1500 

0.585 

0.959 

1.000 

1 

1 

1 

1 

1 

3000 

0.594 

0.961 

1.000 

1 

1 

1 

1 

1 

5500 

0.617 

0.954 

1 

1 

1 

1 

1 

1 

8000 

0.607 

0.957 

0.999 

1 

1 

1 

1 

1 

10 000 

0.595 

0.949 

1 

1 

1 

1 

1 

1 


Normal 

random vectors (ui 

= 0.5, U 2 = 

0.8) 




600 

0.333 

0.874 

0.997 

1 

0.443 

0.493 

0.492 

0.488 

1500 

0.310 

0.901 

0.999 

1 

0.987 

0.997 

0.997 

0.998 

3000 

0.348 

0.889 

0.998 

1 

1.000 

1.000 

1.000 

1.000 

5500 

0.382 

0.871 

0.998 

1 

1 

1 

1 

1 

8000 

0.33 

0.867 

0.998 

1 

1 

1 

1 

1 

10 000 

0.359 

0.868 

0.998 

1 

1 

1 

1 

1 


Normal 

random vectors (ui 

= 0.5, u 2 = 

0.4) 




600 

0.142 

0.364 

0.668 

0.896 

0.078 

0.089 

0.069 

0.102 

1500 

0.131 

0.354 

0.653 

0.890 

0.220 

0.235 

0.230 

0.226 

3000 

0.139 

0.361 

0.662 

0.899 

0.635 

0.660 

0.647 

0.684 

5500 

0.148 

0.352 

0.645 

0.898 

0.97 

0.979 

0.989 

0.989 

8000 

0.152 

0.36 

0.688 

0.905 

0.981 

0.978 

0.986 

0.989 

10 000 

0.137 

0.328 

0.674 

0.886 

1 

1 

1 

1 
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Table 4. Empirical powers of CZZ test and L n at the significant level a = 5% for standardized 
gamma random vectors. Two types of population covariance matrices are considered. In the first 
case, Ei = diag(2 x l[„ p ], l p -[„ p ]) for v = 0.08 and v = 0.25, respectively. In the second case, 
E 2 = diag(y4i,diag(l p _[„ 2P ])), where Ai is a [V 2 p] x [ 112 p\ tridiagonal symmetric matrix with 
diagonal elements equal to 1 and elements beside diagonal all equal to v\ for vi = 0.5, V 2 = 0.8 
and vi = 0.5, V 2 = 0.4, respectively 


p 

CZZ test 

n 


L n 

n 




20 

40 

60 

80 20 

40 

60 

80 


Gamma 

random vectors (v - 

= 0.08) 




600 

0.331 

0.638 

0.891 

0.982 0.999 

1 

1 

1 

1500 

0.356 

0.636 

0.901 

0.979 1 

1 

1 

1 

3000 

0.197 

0.383 

0.638 

0.823 1 

1 

1 

1 

5500 

0.178 

0.361 

0.658 

0.845 1 

1 

1 

1 

8000 

0.199 

0.399 

0.642 

0.85 1 

1 

1 

1 

10 000 

0.216 

0.353 

0.636 

0.843 1 

1 

1 

1 


Gamma 

random vectors (y - 

= 0.25) 




600 

0.621 

0.943 

1.000 

1 1 

1 

1 

1 

1500 

0.610 

0.946 

0.999 

1 1 

1 

1 

1 

3000 

0.579 

0.946 

0.997 

1 1 

1 

1 

1 

5500 

0.596 

0.957 

0.999 

1 1 

1 

1 

1 

8000 

0.616 

0.962 

0.999 

1 1 

1 

1 

1 

10 000 

0.614 

0.955 

0.999 

1 1 

1 

1 

1 


Gamma 

random vectors (iq 

= 0.5, V 2 = 0.8) 




600 

0.192 

0.871 

0.998 

0.972 0.122 

0.413 

0.423 

0.133 

1500 

0.198 

0.883 

0.995 

0.980 0.440 

0.992 

0.993 

0.433 

3000 

0.343 

0.885 

0.995 

1 1 

1 

1 

1 

5500 

0.342 

0.88 

0.996 

1 1 

1 

1 

1 

8000 

0.349 

0.877 

0.998 

1 1 

1 

1 

1 

10 000 

0.337 

0.879 

0.998 

1 1 

1 

1 

1 


Gamma 

random vectors ( 1/1 

= 0.5, V 2 = 0.4) 




600 

0.117 

0.353 

0.650 

0.780 0.087 

0.111 

0.114 

0.120 

1500 

0.138 

0.365 

0.661 

0.799 0.183 

0.215 

0.226 

0.157 

3000 

0.129 

0.349 

0.646 

0.89 0.593 

0.621 

0.627 

0.61 

5500 

0.124 

0.335 

0.678 

0.889 0.945 

0.972 

0.981 

0.986 

8000 

0.142 

0.369 

0.668 

0.901 0.999 

1 

1 

1 

10 000 

0.142 

0.336 

0.668 

1 1 

1 

1 

1 


assumes the underlying variables are normally distributed while Tables 4 and 5 assume 
the central gamma and the central bernoulli random variables, respectively. For the 
diagonal covariance matrix, we observe that the proposed test consistently outperforms 
the CZZ test for all types of distributions, especially for “small” n. For example, when 
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Table 5. Empirical powers of CZZ test and L n at the significant level a = 5% for standardized 
Bernoulli random vectors. Two types of population covariance matrices are considered. In the 
first case, Ei = diag(2 x l\ U p], lj,_[„ p ]) f° r v = 0-08 and v = 0.25, respectively. In the second case, 
E 2 = diag(Ti,diag(l p _[^ 2P ])), where Ai is a [V2p] x [112p] tridiagonal symmetric matrix with 
diagonal elements equal to 1 and elements beside diagonal all equal to v\ for vi = 0.5, V 2 = 0.8 
and vi = 0.5, V 2 = 0.4, respectively 



CZZ test 

n 




L n 

n 




p 

20 

40 

60 

80 

20 

40 

60 

80 

600 

Bernoulli 

0.216 

random 

0.381 

vectors (u - 
0.622 

= 0.08) 
0.849 

0.972 

1 

1 

1 

1500 

0.198 

0.401 

0.632 

0.837 

1 

1 

1 

1 

3000 

0.203 

0.362 

0.622 

0.823 

1 

1 

1 

1 

5500 

0.196 

0.354 

0.627 

0.829 

1 

1 

1 

1 

8000 

0.203 

0.373 

0.638 

0.834 

1 

1 

1 

1 

10 000 

0.213 

0.397 

0.637 

0.822 

1 

1 

1 

1 

600 

Bernoulli 

0.594 

random 

0.952 

vectors (v = 
0.998 

= 0.25) 

1 

1 

1 

1 

1 

1500 

0.619 

0.960 

1.000 

1 

1 

1 

1 

1 

3000 

0.594 

0.964 

0.999 

1 

1 

1 

1 

1 

5500 

0.609 

0.948 

1.000 

1 

1 

1 

1 

1 

8000 

0.589 

0.952 

1 

1 

1 

1 

1 

1 

10 000 

0.603 

0.957 

0.999 

1 

1 

1 

1 

1 

600 

Bernoulli 

0.356 

random 

0.870 

vectors (iq 
0.996 

= 0.5,u 2 = 

1 

0.8) 

0.507 

0.512 

0.526 

0.558 

1500 

0.359 

0.892 

0.995 

1 

0.999 

1 

1 

0.999 

3000 

0.343 

0.877 

0.998 

1 

1.000 

1 

1 

1.000 

5500 

0.355 

0.868 

0.997 

1 

1.000 

1.000 

1.000 

1.000 

8000 

0.332 

0.873 

0.997 

1 

1 

1 

1 

1 

10 000 

0.353 

0.872 

1 

1 

1 

1 

1 

1 

600 

Bernoulli 

0.153 

random 

0.348 

vectors (m 
0.643 

= 0.5,u 2 = 
0.901 

0.4) 

0.092 

0.086 

0.079 

0.085 

1500 

0.154 

0.372 

0.643 

0.878 

0.239 

0.255 

0.235 

0.241 

3000 

0.141 

0.339 

0.649 

0.882 

0.682 

0.680 

0.680 

0.674 

5500 

0.156 

0.343 

0.656 

0.893 

0.997 

0.994 

0.994 

0.994 

8000 

0.144 

0.353 

0.664 

0.904 

1 

1 

1 

1 

10 000 

0.139 

0.356 

0.685 

0.889 

1 

1 

1 

1 


n = 20, even n = 40,60,80 for v = 0.08, the CZZ test results in power ranging from 
0.2-0.8, while our test still gains very satisfying power exceeding 0.932. 

For the banded covariance matrix, we observe an interesting phenomenon. Our test 
seems to be more sensitive to the dimension p. When p = 600,1500,3000, the power of 
our test is not that good for small V 2 (= 0.4). Fortunately, when p = 5500,8000,10 000, 
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the performance is much better, where the power is one or close to one. Similar results 
are also observed for V 2 =0.8. We also note that large V 2 outperforms smaller V 2 because 
when V 2 becomes larger, the corresponding covariance matrix becomes more “different” 
from the identity matrix. As for the CZZ test, its power is mainly affected by n. But 
generally speaking, our test gains better power than the CZZ test for extremely larger p 
and small n. 


2.3. Empirical studies 

As empirical applications, we consider two classic datasets: the colon data of Alon et 
al. [1] and the leukemia data of Golub et al. [17]. Both datasets are publicly available 
on the web site of Tatsuya Kubokawa: http://www. tatsuya. e .u-tokyo. ac . jp/. Such 
data were used in Fisher [15] as well. The sample sizes and dimensions (n,p) of the colon 
data and the leukemia data are (62,2000) and (72,3571), respectively. Simulations show 
that these two datasets have zero mean (10 -8 to 10 -11 ) and unit variance. Therefore, we 
consider the hypothesis test in (2.5) by using the test statistic L n in (2.7). The computed 
values are L n = 33 933.7 for the colon data and L n = 60 956 for the leukemia data. It is 
also interesting to note that the statistic values of Fisher [15] are 6062.642 for the colon 
data and 6955.651 for the leukemia data when testing the identity hypothesis. Also, the 
statistics of Fisher [15] and L n in (2.7) are both asymptotic normality (standard normal). 
As in Fisher [15], we conclude that p -values of the test statistics are zero which shows 
evidence to reject the null hypothesis. This is consistent with Fisher’s [15] conclusion for 
these two datasets. 


3. Truncation and strategy for the proof of 
Theorem 1.1 

In the rest of the paper, we use K to denote a constant which may take different values 
at different places. The notation ol (1) stands for a term converging to zero in L p norm; 

a " s -> means “convergence almost surely to”; lp ' - > means “convergence in probability to”. 

3.1. Truncation 

In this section, we truncate the underlying random variables as in Pan and Gao [24]. 
Choose S n satisfying 

lirn S~ 4 E\X n \ 4 I(\X n \> Sntfnp) = 0, S n f 0, d n fffvp | oo. (3.1) 

n—> oo 

In what follows, we will use 6 to represent S n for convenience. We first truncate the 
variables Ay = Ay/(|Ayj < Stfnp) and then normalize it as Ay = (Ay — £Ay)/er, 
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where a is the standard deviation of X^. Let X = (X t j) and X = ( Xjj ). Define A, A and 
G n (f),G n (f) similarly by means of (1.1) and (1.3), respectively. We then have 

P{A 7 ^ A) < npP(\Xu\ > 5^/rvp) < KS~ 4 E\X 11 \ 4 I(\Xi 1 \ > 8 -tfnp) =o(l). 

It follows from (3.1) that 

|1 - ct 2 | < 2 \EX 2 ll I(\X ll > 5f/rvp\)\ 

< 2(np)~ 1 / 2 S~ 2 E\X 11 \ 4 I(\X 11 \ > Stfnp) =o((np)~ 1/2 ) 

and 

\EX n \< S-^npy^ElXnfHlXnl > StfEp) = o ((np)" 3 / 4 ). 

Therefore 

Atr(X - X) T (X - X) <Y^,E\Xij - X y | 2 

h3 

< Kpn ^ 1 ~ 2 g)2 A|A n | 2 + ^EX^ =o(l) 

and 

E trX T X <J2 E \Xij \ 2 < Kn 'P: E trXX T < ^EIX^ < Knp. 

Recalling that the notation A j(-) represents the jth largest eigenvalue, we then have 
Aj(X 2 X) = y/np\j(A) +p. Similar equalities also hold if X, A are replaced by X, A or 
X, A. Consequently, applying the argument used in Theorem 11.36 in Bai and Silverstein 
[3] and Cauchy-Schwarz’s inequality, we have 

n 

E\G n {f) - G n (f )| < Y^EmXjiA)) - f(Xj(A))\ 
l=i 

n n 

< Kf^ElX^A) - Aj(A)| <-L_Y,E\Xj(X T X) - A,(X T X)| 

l=i ^ P l=i 

< -^-Mtr(X - X) T (X - X) • 2(tr X T X + trX T X )] 1/2 

y/rvp 

< ■^[Atr(X-X) T (X-X) • (E tr X T X + A tr X T X )] 1/2 

y/rvp 

= o(l), 

where Kf is a bound on \f'(x)\. Thus, the weak convergence of G n (f) is not affected 
if we replace the original variables A,;, by the truncated and normalized variables Xij. 
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For convenience, we still use X i3 to denote X i3 , which satisfies the following additional 
assumption (c): 

(c) The underlying variables satisfy 

\X i3 \ <5#np, EX ij = 0, EXfj = 1, EXfj = v 4 + o(l), 
where 5 = 5 n satisfies lim^oo <5~ 4 1S|Xli| 4 J(|Xli| > 5 n tfnp) = 0 , 5 n f 0 , and 

Sntfnp too- 

For any e > 0, define the event F n (e) = {maxj< n |Ay (A)| > 2 + e} where A is defined 
by the truncated and normalized variables satisfying assumption (c). By Theorem 2 in 
Chen and Pan [10], for any l > 0 

P(F n (e))=ofn- 1 ). (3.2) 

Here we would point out that the result regarding the minimum eigenvalue of A can be 
obtained similarly by investigating the maximum eigenvalue of —A. 

3.2. Strategy of the proof 

We shall follow the strategy of Bai and Yao [ 6 ]. Specifically speaking, assume that uq,v 
are fixed and sufficiently small so that ? C =5^ (see the definition in the introduction), 
where c is the contour formed by the boundary of the rectangle with (±uo,±iu) where 
u o >2,0<u<l. By Cauchy’s integral formula, with probability one, 

G n (f) = -^ f(z)n[m n (z) - m(z) - X n (m{z))\ dz, 

where m n (z),m(z) denote the Stieltjes transform of F A (x) and F(x), respectively. 

Let 

M n (z) = n[m n (z) - m(z) - X n (m(z))], zG<;. 

For zg;, write M n (z) = Mn\z) + M„ 2 \z) where 

M n\ z ) = n[m n (z) - Em n (z)], M^ 2 \z) = n[Em n {z) - m(z) - X n (m(z))]. 

Split the contour c as the union of c„,Q,c r , Co where Q = {z = —uq + m,£ n n -1 < |v| < 
ui},<r r = {z = u 0 + i v,£ n n~ l < |u| < Vi},co = {z = ±u 0 + iu, M < £ n u -1 } and <?„ = {z = 
udz ivi, |iz| < uq} and where £„ is a slowly varying sequence of positive constants and V\ 
is a positive constant which is independent of n. Throughout this paper, let Ci = {z: z = 
u + iv,u£ [—uo,uo],|u| > ui}. 


Proposition 3.1. Under assumptions (bl), (c), the empirical process {M„(z),z£ Ci} 
converges weakly to a Gaussian process {M[z),z £ Ci} with the mean function 


A{z)=0 


( 3 . 3 ) 
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and the covariance function 

A( 2 i, 2 2 ) = m'{zi)m'( z 2 )[va - 3 + 2(1 - m(zi)m(z 2 )y 2 ]- (3.4) 

As in Bai and Yao [ 6 ], the process of { M(z ), z £ Ci} can be extended to {M(z ). Sft(z) ^ 
[— 2 , 2 ]} due to the facts that (i) M(z) is symmetric, for example, M(z) = M(z); (ii) the 
mean and the covariance function of M(z) are independent of v± and they are continuous 
except for ^ [—2,2], By Proposition 3.1 and the continuous mapping theorem, 

“ 2 “- / f(z)M n (z)dz f f{z)M(z)dz. 

Thus, to prove Theorem 1.1, it is also necessary to prove the following proposition. 

Proposition 3.2. Let z €C±. Under assumptions (bl), (c), there exists some event U n 
with P(U n ) —> 0, asn-> oo, such that 


and 


lim lim sup E 

^ilO n—yoo 


lim lim sup 

w l!0 n—too 


^ Uj={ r.O ^ 


M^\z)I{U c n )dz 


Ui=I r,0 ^ 


EM n (z)I(JJ*)dz 


= 0 , 


= 0 


lim E 

witO 


M^(z)dz 


= 0, lim E 

V!lO 


M{z)dz 


= 0 . 


(3.5) 

(3.6) 

(3.7) 


Since E\M^(z)\ 2 = A (z,z) and E\M{z)\ 2 = A (z,z) + \EM(z)\ 2 , (3.7) can be easily 
obtained from Proposition 3.1. For i = 0, if we choose U n = F n (e) with the e = (uq — 2)/2, 
then when f/“ happens, Mz £ co, we have |m„( 2 )| < 2 /(uq — 2 ) and \m(z)\ < 1 /(uq — 2 ). 
Thus 


[ My\ Z )i(u c n 

j < 7 n 


dz 


< n 


Mo - 2 


"ikoll < 


(mo-2)2’ 


where 1111 represents the length of Co- Furthermore, 


M n (z)I(Uff)dz 


< n 


2 1 n 

-1- \- K— 

Mo — 2 Mo — 2 p 


These imply that (3.6) and (3.5) are true for z € % by noting that —> 0 as p —> oo. 

Sections 5 and 6 are devoted to the proof of Proposition 3.1. The main steps are 
summarized in the following: 

• According to Theorem 8.1 in Billingsley [ 8 ], to establish the convergence of the 
process {M n (z),z € Ci}, it suffices to prove the finite-dimensional convergence of 
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the random part M^ ( z ) and its tightness, and the convergence of the non-random 
part Mn\z). 

• For the random part Mn\z), we rewrite it in terms of a martingale expression so 
that we may apply the central limit theorem of martingales to find its asymptotic 
mean and covariance. 

(‘ 2 ') 

• For the non-random part Mn '(z), by the formula of the inverse of a matrix and the 
equation satisfied by m(z) we develop an equation for ( Em n (z) — m(z)). Based on it, 
we then find its limit under assumptions n/p—> 0 and n 3 /p = 0(1) for Theorem 1.1 
and Corollary 1.1, respectively. 

Section 7 uses Lemma 4.4 below to finish the proofs of (3.5) for i = l,r so that the 
proof of Proposition 3.2 is completed. Section 8 uses Bai and Yao’s [6] asymptotic mean 
and covariance function to conclude the proof of Theorem 1.1. 


4. Preliminary results 


This section is to provide simplification of Mn\z) and some useful lemmas needed to 
prove Proposition 3.1. 


4.1. Simplification of M^(z) 


The aim of this subsection is to simplify M^\z) so that Mn\z ) can be written in the 
form of martingales. Some moment bounds are also proved. 

Define D = A — zl n . Let s*, be the fcth column of X and be a p x (n — 1 ) 
matrix constructed from X by deleting the fcth column. We then similarly define 
Ak = -^(X^X/; — pl ra _i) and Dj, = A a, — zI n -±. The fcth diagonal element of D 

is a^ fc as = -^=p (si s k — p) — z and the fcth row of D with the fcth element deleted is 
= -A=s^Xfe. The Stieltjes transform of F A has the form m n (z) = ^trD -1 . The 
limiting Stieltjes transform m(z) satisfies 

m(z) = - 1 , \m(z)\ < 1 (4.1) 

2 + m[z) 


(one may see Bai and Yao [6]). 

Define the cr-field Tk = <r(si,S 2 ,... ,s*,) and the conditional expectation Ek(-) = 
E{-\Fk). By the matrix inversion formula, we have (see (3.9) of Bai [4]) 


(i + q^DfeV) 

-a£? s + q£D*V 


tr(D _1 - D" 1 ) 


(4.2) 
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We then obtain 

n n 

M£\z) = trD- 1 - -EtrD” 1 = J2( E k ~ E k-\) tr(D _1 - D^ 1 ) = ^ g k (4.3) 

k= 1 k= 1 

= {E k ~ Ek~i)tk ~ E k K k , (4.4) 

where 


g k = -{E k - Sfc_i)/3 fe (l + q^D fc 2 q fc ), 
tfc = ~Pk PkVk{l + qfc 2 q fc ), 


= -^(SfeSfc -p) -7*1, 

y/np 


Pk 


1 

z+ (l/(np))trMj^ 


Pk = 

M i 8) 


- a tt g + qfe D fc 
= X fe D- s X^s = l,2, 


7fes = qfc D fc s q fc - {np) 1 tr M[ s) , K fc = /3f ■y fc2 • 


In the above equality, p*, is obtained by (4.2) and the last equality uses the facts that 


Pk = P k + PkPkPk 


(4.5) 


and 


{Ek ~ E k _ i) 


1 

np 


Pk{ 1 + ^trMf 


= 0 , 


Ek-iKk = 0 . 


We remind the readers that the variable z has been dropped from the expressions 
such as D-^Dfc ,fi k , 7 ks and so on. When necessary, we will also indicate them as 
B~ 1 {z),'D^ 1 {z),Pk{z), 7 ks {z), etc. 

We next provide some useful bounds. It follows from the definitions of D and D*, that 


D" 1 X t X = pD ” 1 + ^p{I n + zD” 1 ), 
D^X^X = pD ” 1 + V^(In-i + zB- 1 ). 


(4.6) 


Since the eigenvalues of D 1 have the form l/(Aj(A) — z), ||D 1 || < \/v\ and similarly 
HE )” 1 || < l/v\. From Theorem 11.4 in Bai and Silverstein [3], we note that —/3 k {z) is the 
fcth diagonal element of D ” 1 so that \/3 k \ < l/v\. Moreover, considering the imaginary 
parts of 1/P k and 1 //3 k and by (4.6) we have 


and 


\Pk\ < 1/wi, 


1 + — tr M 

np 


M 


<(1 + 1 /vl s ), 


s = 1,2 


(4.7) 


l(i + q^ D fc 2 <\ k )Pk\ < 


i + q^ D fc lD /qfc 

Ui(l + q|’D^ 1 D fe 1 q fc ) 


1 /ui. 


(4.8) 
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Applying (4.5), we split i k as 

ifc = _ f 1 + fZ trM i 2) ) Wkfvk - Iki (Pkfvk - fl + ^-qZ'Df 2 qk) ll 

\ J \ 't'P / 

= t/cl + t/c2 + tfc3- 


As will be seen, i k i,L k 2 could be negligible by Lemma 4.1 below. 
By Lemma 4.1, (4.7) and (4.8), we have 


E 


^(-Efc — Ek-i)ik3 

k =1 



2 

< A'<5 4 , 


and that 


E 


^(Afc — Ek-i)tk2 


fe=l 


n n jr 

<J2 E MPkfvk\ 2 < Kj2(E\ lkl \ 4 E\r, k \ 4 ) 1/2 < —- +KS 2 . 

k= 1 fc=l ^ 


Therefore, M^ 1 ' 1 


(z) is simplified as 


/c=l 


1+ — tr M 
np 


( 2 ) 


(/3fc r ) Vk - Kfc 


°i 2 ( 1 ) 


= ^A fe (a fc (z))+o L2 ( 1 ), 
k =1 


(4.9) 


where a k (z) represents the term in the square bracket. Thus, to prove finite-dimensional 
convergence of Mn\z),z £ Ci we need only consider the sum 

In n l 

E^E ■^kiS^ki^jY) — EE aj E k (a k ( Zj )), (4.10) 

j —1 k =1 k=l j=l 

where a±,... ,ai are complex numbers and l is any positive integer. 


4.2. Useful lemmas 

The aim of this subsection is to provide some useful lemmas. 

Lemma 4.1. Let z € CUnder assumptions (bl), (c), we have 

E\ lks \ 2 <Kn~ 4 , E\ lks \ 4 <K(\ + ^ + — Y (4.11) 

\n z p z np J 

E\r] k \ 2 < Kn~ 4 , E\ Vk \ 4 < K— + K ( -1 + 4 + — V (4.12) 

n \n z n z np J 
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Proof. From Lemma 5 in Pan and Zhou [25], we obtain 

E\slUs k - trH | 4 < K{EX^fE{ trHH ) 2 < KEitrM^M^f < Kn 2 p A , (4.13) 

where H = - diag(a^,..., al s 2) and afjj is the jth diagonal element of the matrix 

To get the third inequality in (4.13), by (4.6) and the uniform bound for ||D]7 1 ||, 
we obtain 


ItrM^M^I = |trD* s X£X*D fc s X£X fc | < 


n 


2(s—1) 




2„4 


- „2(s-l) 


— 1 \ m 2 , Kn Z p 


(4.14) 


+ v^P(I»-i + ^k)\\ < 


,.2s 


Let Ej(-) = E(-\X lk ,X 2k , • ■ -,Xj k ),j = 1 Since {X jk } k j=1 are independent of off, 

(X 2 k — 1 )aj S j = (Ej — Kj-i)(Xj k — l)a^. By Burkholder’s inequality and assumption (c) 


E 


33 

P 


1 = 1 


1 {S) 

l 33 


= E 


E( E.-E.-O^-l)^.? 


l=i 


< KEi^TE^nl^a 

Vi=i 


4|„( s )| 2 

'33 ' 


+ KY / E\Xn\ 8 E\a jj \ 4 (4.15) 
1=1 


< Kn 5 p 2 + n 3 p 3 


where we use the fact that, with wj being the j th row of X&, 


Fl|a^| 4 = £;|eJX fe D^ s X^| 4 


= /•: w ( 1). s w, 1 < v^EWwff < Kn 4 + Kn 2 p. 


Ti|8 


(4.16) 


Here for j = 1,... ,p, denotes the p-dimensional unit vector with the jth element being 
1 and all the remaining being zero. It follows from (4.13) and (4.15) that 


£|7 fe J 4 < -4~tE 
1,1 “ n 4 p 4 


1=1 


l 33 


K Fi|s^Hs fc -trH| 4 


n 4 p 4 


, 1 n 1 

< K\ — 4 —- H- 

k n z p z np 


Moreover, applying Lemma 8.10 in Bai and Silverstein [3], we have 


K 


E\p k \ < — —^E\s k Sfc — n\ + iL.E| 7 fci(z)| <K -f K — H—H- 


n 2 p 2 


E 

P 
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The bounds of the absolute second moments for 7 /- s , rjk follow from a direct application 
of Lemma 8.10 in Bai and Silverstein [3], (4.6) and the uniform bound for || 1 1|. □ 

When z£qU g r , the spectral norm of D~ x (z) as well as the quantities in (4.7) or 
Lemma 4.1 are unbounded. In order to prove Lemma 6.1, we will establish the bounds 
similar to those in (4.7) and in Lemma 4.1 for z6gUc r below. 

Let the event JJ n = {maxj< n |Aj(A)| > uq/2 + 1} and U nk = {maXj< n |Aj(Afc)| > 1 + 
uq/ 2}. The Cauchy interlacing theorem ensures that 


Ai(A) > Ai(A fc ) > A 2 (A) > A 2 (A fc ) > • > A„_i(A fc ) > A„(A). (4.17) 


Thus, U n k CU n . By (3.2) for any l > 0 

P(U nk )<P(U n )=o(n~*). (4.18) 


We claim that 


max{||D 1 (^)||, ||D 1 (z)\\,\fi k \} < ^n; 

m C n) 


|Aj(A) — z\ 

Wnk) 

\Xj(A k )-z\ 


<K 1 


<K. 


j = l,2,...,n, 

* = 1,2,..., (n. 


i); 


(4.19) 

(4.20) 


||D- 1 (z)||/(17^) < 2/(« 0 - 2), \\T>^(z)\\I(U^ k ) < 2/( Uo - 2). (4.21) 


Indeed, the quantities in (4.19) are bounded due to |l/S(;z)| < 1 n while (4.20) holds 
because I(Uf L )/\\j(A) — z | (or I(JJf l )/\\j(Ak) — z\) is bounded by vf 1 when z £ g u 
and bounded by 2/{uq — 2) when z £ q U g r . The estimates in (4.21) hold because of 
the eigenvalues of D _1 /(f/))) (or D^ 1 /({/°)) having the form I(Un)/(Xj(A) — z) (or 
I(Un)/ (Aj(A fc ) — z)). 


Lemma 4.2. Let z £ g n . The following bound 


\0 k \I(UZ)<K, 


(4.22) 


holds. 

Proof. In view of (4.2), to prove (4.22), we need to find an upper bound for ItrD” 1 — 
trD)7 1 |/(t/^) and a lower bound for |l + q^Dj) 2 qfc| KJJff). It follows from (4.20) and 


(4.17) that 


ItrD" 1 - trD- 1 1 1{U c n )< 


n—1 

E 


y _— 

Aj(A) — z ^Aj(A k )-z 


W C n) 
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Wn) 

(4.23) 



< K( Ai(A) - A„(A) + 1 )/(J7=) < A > 0 + 3). 

Let Uj(Afc),j = — 1 be the eigenvectors corresponding to the eigenvalues 

Aj(Afc), j = 1,. .. ,n — 1. Then 5^j=i ^a") 7 — ^ s P ectra l decomposition of D)( 2 - 

We distinguish two cases: 

(i) When z G V\ = g u U {z : |S(z)| > (ito — 2)/4}, via (4.7), we then obtain 

\Pk\I{U°) < l/| 3 (z)l < maxjv-j - 1 , 4 /(u 0 — 2 )} < A. 

Thus, (4.22) is true for z G V\. 

(ii) When z € V 2 = (q U? r )n {2 : |S(z)| < (uo — 2)/4}, if U£ happens, we have |Aj(A fc ) — 
5ft(z)| > since Sft(z) = ±uq for z eV- 2 - A direct calculation shows 


< ( y' Aj(A) - Aj(Afc) _ 1 \ 

“ Vt^I ~ ~ Z \ | An ( A ) -Z\) 


5ft((l + qi , D- 2 q fe )/(^)) = l + ^ 

i=i 


(A J (A fc )-jR(z)) 2 -|S(z )| 2 

|Aj(A fe ) — 2 r | 4 


(q*u i (A fc )) 2 /(c^)>i. 


Therefore, |1 + q J D fe 2 q|/(LT) has a lower bound which, together with (4.23), 
implies (4.22) is true for zGV^. 

Since g n = V) U V 2 , we finish the proof of Lemma 4.2. □ 


Lemma 4.3. Let 2 G <; n and p k = ^f( s fc s fc - p) - qfcD fc 1 (z)q fc + trXD 1 (^)X T . 
The following bounds hold 


and 


5 A 

E\fL k \ 4 <K— + K 
n 



n 




\ E Pk\=o{n x ). 


(4.24) 


(4.25) 


Proof. Write 


Pk 



(sls k -p) 


Tki + 




trD” 1 ^) 


-tr T> k (z) 


-( 1+ "\/f)(» trD "' w_ ' E «‘ rD ’ lw ) 

= Li — 7fei + L3 + L4 + L5 ■ 
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When the event U% happens, reviewing the proof of the second result of (4.11) and via 
(4.21), we also have 

E\ lks \*I{U c n )<K(\ + ^ + —), m = l,2. 

\ n z p z np J 


Moreover, by (4.18) and (4.19) 


It follows that 


E\ lks \ 4 I{U n ) = o{n~ t ). 


1 n 1 


E\ lks \ 4 <K(- + -^ + —), to = 1,2. 
nr p np j 


(4.26) 


Using Lemma 8.10 in Bai and Silverstein [3], (4.18), (4.19) and (4.23) we then have 

E\Li\ 4 ‘ < K5 4 n~ 4 , E\L 3 \ 4 <Kn ~ 4 , E\L 5 \ 4 < Kn~ 2 p~ 2 . (4.27) 

As for L 4 , by Burkholder’s inequality, (4.3) and (4.23), we have 


E\L 4 \ 4 < Kn~ 4 E 


^^(Ek — E k -i)(ti D -1 — tr D^T 1 ) 

k=1 

/ n \ 2 

<Kn- 4 Y / E\^'D~ 1 {z)-ti'Df 1 {z)\ 4 + Kn- 1 E{J2Ek\trB- 1 (z)-ti 1 Df 1 (z)\ 2 

fc=1 \fc=:i / 

n 

<Jfn- 4 X)^|trD- 1 (*)-trDfc 1 (z)| 4 7(t7“) (4.28) 


k=1 


+ Kn- 4 E\^TE k \trD- 1 (z)-tr-Df 1 (z)\ 2 j I(U^+o(n~ e ) 

< Kn~ 2 . 

Therefore, the proof of (4.24) is completed. Also, the analysis above yields 


E\L 1 -'y kl \ 4 <K(- + -^ + ^ + — ) <KS 
\n n z p z np J 

It is also easy to verify that, for z€? n , 

1 


E\L 3 + L 4 + L 5 \ 4 <Kn~ 2 . (4.29) 


E 


y/np 


K Sfc - p) 


<Kn \ E\r :km \ z < Kn \ 


(4.30) 
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We proceed to prove (4.25). First of all 


\EL\\ = 


( np ) 3 / 2 


\i=i ) 


(4-31) 

3 *') j =l 


For s = 1,2, denoting = (a^) pX p, we then have 


-®7fcs = -^“3-E 


('Ex ik X jk a$+'£ / {Xf k 


- l)a. 


0 ) 


= Ji + J2 + J3 + J4, 


i= 1 


where 


* = E ^ 

* = 3 ^ E (E X “< X « - - '(“SMfd?) • 




2 0) (s) (s) 

a ij a ii a ji 


The inequality (4.16) can be extended to the range z £ <; n by a similar method as that 
in (4.26). Therefore, 


1 


IJ 2 I < K^-^rpS 2 y/np(n 4 + n 2 p) 3 ^ 4 < KS 2 n \ 

IJ 3 I < K-^-p 2 E\\wi\\ 3 E\\wj\\ 3 + o(n~ e ) < Kp^ 1 + o{n~ e ), J 4 < Kp _1 +o(n _f ), 

n°p° 

where wj is the jth row of X&. 

Consider J\ now. We first note that J 12 = 0(p _1 ). Split J 12 as 


J12 = 


n J p 




n°p° 


(a) (s) (a) 

<4 <4 % 


iz£t 


, 1 _(«) _(«)_(«) , 1 _(«)_(«)_(») , 1 pVliWl 3 

+ r3T3 i Z^ a b a i3 a H + Z^ h Z^ a iJ a P a » + ) 


n°p’ 




n°p' 






< Kn 2 + Kp 1 . 
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Thus, we obtain 

^UKK^n^+p- 1 ). (4.32) 

It follows from (4.29), (4.30) and (4.32) that 

\Efi\\ < \E(L\ — 7 fci) 3 | + \E{L$ + Li + L§) 3 \ + 3|£’(Ti — 7 m)(L 3 + 4 + L 5) 2 1 
+ 3|-E(Li — 7fei) 2 (L3 + 4 + £ 5)1 

< \EL\\ + |4y£ 3 | + 3 E 1/2 EL\ ■ E 1/2 jZ s + 3 E 1/2 Ll ■ £ 1/2 7 t + Kn~ 3/2 + KSrT 1 

= o(n -1 ). 

The proof of Lemma 4.3 is completed. □ 

The following lemma will be used to prove the first result of (3.5) and (6.15) below. 
Lemma 4.4. For we have 


E\MP(z)\<K , 


where Mn\z) = n{m n {z) 


Em n (z)). 


Proof. Note that the expression Mn\z) in (4.3) may not be suitable for z £ since 
/ 3ff or even /3j, r /([/°) may be not bounded. For this reason, we introduce the following 
notations with the purpose to obtain a similar expression to (4.3). Let 


4 = 


1 

2 + (l/(np)).EtrMj4 


4 = 



(s^s fc -p)-7 fc i 


— trM 

np 


(i) 

k 


—E trM 
np 



Hence 


4 — 4 + 444- 


(4.33) 


As in (4.3) and a few lines below it, by (4.33), we write 


M^\z) — y^(£'fc — 4-i)(4i + 4 2 + <fe3 + 4) ; 

k= 1 


where 


4i (- 2 ) 


1 + 


— trMj; 2 )N )(4) 2 4, 

np / 


42 ( 2 ) =-7fci(4) 2 4, 


43(2) 


1 + — q ^ D fc 2 (-) q ^ 


4(4) 2 /ife 


4 = 47 * 2 ( 2 )- 
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We next derive the bounds for e k and the forth moment of fi k . Since F n a ' s > F as n —> oo, 
we conclude from (4.18), (4.19), (4.21) and the dominated convergence theorem that, for 
any fixed positive integer t 

, t 


E\m n {z) — m(z)\ —> 0. 
By (4.6), (4.23) and (4.34), we then have 


E— trMi 1} = E 
np 


Hence, 


1 + z 


i(z) — ( 1 + 


= 


1 

< 

2 

z + m(z) + o(l) 


z + m(z) 


n\ 1 , „ , _i, n — 1 

— (tr D — tr D fc ) 4 -— 

p J n v/hp 


< 2 . 


(4.34) 


—> m(z). 


(4.35) 


On the other hand, via (4.6), (4.23) and (4.28) 


E 


1 


— trMf 1 -E— trivd 15 
np np 


< 1 + z 


n 4 i?|trD l —E trD 1 \ 4 <Kn 


and this, together with (4.26), implies 

E\p k \ 4 <K—+K 

n 


1 n 1 

— + — 4 - 

n z np 


-2 


(4.36) 


Combining (4.35), (4.36), Lemma 4.2, (4.18), (4.19), (4.21) with Burkholder’s inequality, 
we obtain 


E\M^(z)\ 2 <K. 

The proof of the lemma is completed. 


□ 


5. Convergence of M^\z) 

To prove Proposition 3.1, we need to establish (i) the finite-dimensional convergence and 
the tightness of Mn\z)\ (ii) the convergence of the mean function EM(z). This section 
is devoted to the first target. Throughout this section, we assume that zeCi and K 
denotes a constant which may change from line to line and may depend on v\ but is 
independent of n. 

5.1. Application of central limit theorem for martingales 

In order to establish the central limit theorem for the martingale (4.10), we have to check 
the following two conditions: 
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Condition 5.1 (Lyapunov condition). For some a> 2, 




k=l 


E k 'y ' a :j Ip;; (o’k ( Zj ) ) 


A0. 


Condition 5.2. The covariance 

n 

A n (zi,z 2 ) = y^jg fc _i[-E fc Q fc ( 2 :i) • E k a k (z 2 )\ 


(5.1) 


fc=1 


converges in probability to A(zi,z 2 ) whose explicit form will be given in (5.29). 


Condition 5.1 is satisfied by choosing a = 4, using Lemma 4.1, and the fact that via 
(4.7) 


KOOI = 


np 


1 + —trMi 2) )(/3nV + ^ r 7fe 


< 


l+v l 


\Vk\ + ~hk\- 


Consider Condition 5.2 now. Note that 


<*k(z) = ~ (i + E trMf) 'Ik 0k = §~Mvk)- 


By the dominated convergence theorem, we have 

d 2 


A n (zi,z 2 ) = Q ^2 E k-i[ E k(0k(zi)Vk(zi)) ■ E k (/3l r {z 2 )p k (z 2 ))}. (5.2) 


fc=i 


By (4.6), (4.2), (4.8), (4.1) and the fact m n (z) -^4 m(z), and the dominated convergence 
theorem again, for any fixed t, 


E 


— trM! 1 ' — m(z) 
np k 


—»0, E\(5 k {z) + m(z)\ —> 0, asn—>oo. (5.3) 

Substituting (5.3) into (5.2) yields 


A n (zi,z 2 ) = 


d 2 


dz 2 dzi 

d 2 

dz 2 dz± 


m(zi)m(z 2 ) E k -i(E k r] k (zi) ■ E k y k (z 2 )) +Oi. p .(l) 
k =l 

m(zi)m(z 2 )A n (zi, z 2 ) + Oi. p .(l)\. 


(5.4) 
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By Vitali’s theorem (see Titchmarsli [28], page 168), it is enough to find the limit of 
A n ( 21 , 22 )- To this end, with notation ^(M^, 1 ^^)) = ( aij(z)) nxn , write 


XikXjkdij (z) + '£(X? k -l)a ii (z)]. 

V P j =1 P \i^j 


np 


i= 1 


By the above formula and independence between {X, : / C }( J =1 and -Efc(M^), a straightfor¬ 
ward calculation yields 


Ek-i[EkT]k(zi) • EkVk(z2)\ — — E(X 11 — 1) + Ai + A 2 + A 3 + A 4 , (5-5) 

n 


where 


Ai =- ^=E{Xl - 1 fTauizJ, A 2 =- ^—=E{Xl 1 - if V fl „( 22 ), 

npyjrvp np^/np z —' 


p 


2=1 


2 p 1 2 

A-3 = -^-9 y^Qij(^i)Qjj(-g 2 ), A 4 = E(xf 1 - 1) 'Y' j au(z 1 )a i i(z2). 

n p ijtj n p *= 1 

Note that au{z) is precisely E^a^f in (4.16). From (4.16), we then obtain for j = 1,2,4 


v 


E 


Y. A > 

k =1 


-> 0 . 


Also, we conclude from (4.16) that 


E ^ 3 


k=l 


2 

n 


E Zfe 

k= 1 


2 

n 2 p 2 


n p 

EE a»tCzi)a*i(z2) 

fc=l 2=1 


2 > \ 

— + °Li (1); 

n z —' 


where 

Z fc = -EtrBfcM^^i) • E k Mf\z 2 ). 
Summarizing the above we see that 


E n (z-L,z 2 ) 


2 x ^ 

— 53 Zfc + 1^4 — 1 + Ol 1 ( 1 ). 

n z —' 

k= 1 


(5.6) 


5.2. The asymptotic expression of Z& 

The goal is to derive an asymptotic expression of with the purpose of obtaining the 
limit of K n {zi, z 2 ). 










CLT for normalized, sample covariance matrices 


31 


5.2.1. Decomposition of Zj, 

To evaluate Z&, we need two different decompositions of {z). With slight abuse 

of notation, let {e^, i = 1 ,..., k — 1 , k + 1, ..., n} be the (n — l)-dimensional unit vectors 
with the zth (or (i — l)th) element equal to 1 and the remaining equal to 0 according as 
i <k (or i> k). Write + s,ef. Define 




y/rvp 


( X fa: X fc -pl(i)) - zln-l 


Dfci = D fe - e, : h' - r, ; e/ = 


y/np 


( X fei X fci Pl(i)) Zl n -1, 


A /W 


v'T C . 
-^■ki° z 5 


rrt 1. rrt X . r T 1 . m 

^ = ^ X ^ + ^ (S * Si - p)e - 

0 = YTA’ * = 


(5.7) 


Here I(j) is obtained from I„_i with the zth (or (z — l)th) diagonal element replaced by 
zero if * < k (or i > k). With respect to the above notations we would point out that, 
for i < k (or i > k), the matrix X^,; is obtained from X*, with the entries on the zth 
(or (z — l)th) column replaced by zero; hj is the zth (or (z — l)th) row of A*, and r; 
is the zth (or (z — l)th) column of A*, with the zth (or (z — l)th) element replaced by 
zero. (XjTjXfc — plu)) is obtained from (X^Xj, — pl„_i) with the entries on the zth (or 
(z — l)th) row and zth (or (z — l)th) column replaced by zero. 

The notation defined above may depend on k. When we obtain bounds or limits for 
them such as -f tr D^ 1 the results hold uniformly in k. 

Observing the structure of the matrices X^ and Dj” 1 , we have some crucial identities, 


X/^e^ — 0, 


- T D 1 — e T D -1 — — z~ 1 ( 
L-’ki ^ z 1 


ki,r 


(5.8) 


where 0 is a p-dimensional vector with all the elements equal to 0. By (5.8) and the 
frequently used formulas 


we have 


Y” 1 - W 1 = — W -1 (Y - W)Y , 
(Y + ab T ) _1 a = 


b J (Y + ab T ) _1 = 


l + b T Y- 1 a’ 
b T Y” 1 
l + b T Y- 1 a’ 


Dr 1 - Dr 1 = -ODr 1 edi' D ,: 1 , 

k kx.r ki.r 1 i Aii.r’ 


Zyfrvp 


D^/X^ef. 


(5.9) 


® ki,r ^ki 


(5.10) 
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We first claim the following decomposition of EkM.^\z), for i<k , 

\z ) = iHfcMfcj — J + E k f —] 

\~™P / \2>/^ / 

+ Sjsf E k 


S,;S, 


Jp. I Ci t\ t \ 77 .. / Ci \„ „T 


__M fci -£* — SjS; 

y v - y 

= l?i(;z) + - 82 ( 2 ) + (z) + B±{z) + B§(z). 

Indeed, by the decomposition of Xfc, write 

M« = X* i D* 1 X£ +X fci D^ 1 e i sf + s i efD* 1 X& + s,ef D^sf. 

Applying (5.7), (5.8) and (5.10), we obtain 

X fci D^X£ = X fci Dj^.X& - CiX^D^e.hfD^Xi; 

= Mh XfejDj.^X^) 


Zy/np 

Ci 


y/rvp 


= M fci -— M ki SisfM ki . 

znp 


Similarly, 


X fcl D fc 1 e i sf = — -C=M ki SisT 


Zy/np 




s i e f D^'X^j = 


s e T D _1 e s T — Ose T D _1 e s T — ——s s T 

— ■ L 'fci,r e i b * — z h * h i • 

Summarizing the above and noting E k (si) = s t for i < k yield (5.11), as claimed. 
On the other hand, write 


Zy/rvp 


:-SisfMki , 


D/ = ]T e ihf — zln-i- 
*=i(#fe) 

Multiplying by D,) 1 on both sides, we have 

n 

zD fc = — I«-i + e ihi D fe . 

i=i(jtk) 

Therefore, by (5.8), (5.10) and the fact that X^X^ = we have 

n 

zE k (M.£\z)) =-E k (X k X%) + Y, ^-i(X,e, hfD^X£) 

»= !(#*) 


(5.11) 


(5.12) 
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( n \ n 

£ s i 8 n+ £ E k (C zSl hjB^ r (Xli + e lS f)) 

i=l(^k) ) i=l(^Lk) 

(5.13) 

= -(n-k) I„-i - £ Sjsf + £ E k (-^SisfM ki j 

i<k i=l(^k) W n P J 


+ £ EhfadiSisl). 

i=l&k) 


Consequently, by splitting E k (M.^\z 2 )) as in (5.11) for i < k and z\E k (Js/l2\ z i)) as in 
(5.13), we obtain 


Zl Z k = tr^M^^) • E k M[}\z 2 ) 

= C 1 (z 1 ,z 2 ) +C 2 (z 1 ,z 2 ) +C 3 (zi,z 2 ) +C 4 (z 1 ,z 2 ), 


(5.14) 


where 


Ci(zi,z 2 ) = -^(n- /c)tr£ , fe M[ 1) ( 2 2 ), 

np A 


1 


C 2 (Zl,Z 2 ) = S T £ Si = £ C ' 2 T- 


np 


i<k \j =1 


C 3 (z 1 ,z 2 ) = -^Y / E k 

nn z z ' 


np* 


i<k 


Q{zi) 

y/W 


3=1 
/ 5 


sfMfci(zi) y ^Bj(z 2 ) Si 


<3=1 




np 


i>k 


M fcl (zO^fcM^ (z 2 )s. 


= £^, 

3=1 


Ci(zi,z 2 ) = —- y^E k 

n/n z z —' 


np* 


i<k 


Ci(zi)di{zi)sJ (j2 B j{z 2 ) 


<3=1 


+ ^T, E ^ z M^fE k M ^\z 2 )s i ]=Y t C Aj , 

P i>k j=l 


where C 2j corresponds to B :j ,j = 1,..., 5, for example, C 2 i = -yjy )C,:<fc s f (-Bi(~ 2 ))s,;, 
and C3J and C^j are similarly defined. Here both 63(24, z 2 ) and 64(24, z 2 ) are broken up 
into two parts in terms of i > k or i < k. As will be seen, the terms in (5.14) tend to 0 in 
L 1, except C 2 5, C34, C45. Next let us demonstrate the details. 
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5.2.2. Conclusion of the asymptotic expansion of hk 

The purpose is to analyze each term in Cj(zi,Z 2 ),j = 1,2,3,4. We first claim the limits 
of which appear in Cj(z\,Z 2 ) for j = 2 ,3,4: 

i9i ~^m(z)/z, Ci(z') ——> — zm(z'), asn— >oo. (5.15) 


Indeed, by (5.8) and (5.10), we have 

1 


= — sTM ki Si - 


1 


:(sfs i-p). 


znp " Zy/np 

Replacing in 7 km{z) by M^, by a proof similar to that of (4.11), we have 

E 


(5.16) 


1 T 

1 

4 

/ 1 

1 \ 

—Si MfciSi - 

— tr M fc i 

< R 

“^7 H - 

np 

np 


v n^ 

np j 


(5.17) 


By (4.6), we then have — -A trD ^ 1 0. To investigate the distance between trD ^ 1 
and trD^ , let A ki be the matrix constructed from A*, by deleting its ith (or (i — l)th) 
row and ith (or (* — l)th) column and write Tiki — Dfci(z) = A ki — zl n -2 if i < k (or 
i > k). We observe that D )" 1 can be obtained from D ^- 1 by deleting the ith (or (i — l)th) 
row and ith (or (i — l)th) column if i < k (or i > k). Then trD^ — trDj ). 1 = — fo By 
an identity similar to (4.2) and an inequality similar to the bound (4.8), we also have 
| trD ^ 1 — trDfcj| < 1/iq. Hence | trDjT 1 — ti’D^- 1 ! < (l/iq + l/\z\). From (4.2), we have 

| trD^T 1 — tr D| < l/v\ as well. As ^ trD -1 m(z) for any fixed t by the Helly-Bray 
theorem and the dominated convergence theorem, we obtain the first conclusion of (5.15). 

Since the imaginary part of (^Ci) -1 is (3(z) + ^9(sf MfcjSj)) whose absolute value is 
greater than v\, we have |£i| < \z\/v\. Consequently, via (4.1), we complete the proof of 
the second consequence of (5.15), as claimed. 

Consider Ci(z\,Z 2 ) first. By (4.6), 


E\C 1 (z 1 ,z 2 )\=E 


- 5 - {n - k) tr EkM^ (z 2 ) 


n 2 p 


<JL n ^ p = K l^ 0 . 

np z p 


(5.18) 


Before proceeding, we introduce the inequalities for further simplification in the follow¬ 
ing. By Lemma 8.10 in Bai and Silverstein [3] and (4.6), for any matrix B independent 
of Si, 


£|sfM fcl Bsi | 2 < K{E |sfM fcl Bs, ; - trM fei B | 2 + KE\ trM fet B| 2 ) < Kp 2 n 2 E ||B|| 2 , 

(5.19) 
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where we also use the fact that, via (4.6), 

ItrM^BBM^I = |trD- 1/2 X^BBX fci D“/x^X fci D- 1/2 | 
<n||D^ 1/2 X|;|| 2 .||B|| 2 .||X fc . i D- 1 X^|| 

= n-||B|| 2 .||D- 1 X£X fel || 2 
= n - 1|B|| 2 • IbD" 1 + +zD- 1 )|| 2 

<Knp 2 ||B|| 2 . 


For i > fc, since EffM-k is independent of s,;, we similarly have 

E\sfE k M k B Si \ 2 <Kn 2 p 2 . (5.20) 

Applying Cauchy-Schwarz’s inequality, (5.19) with B = I„_i and the fact that |Ci| is 
bounded by \z\fv\ we have 

E\C 2 j \<K^, j = 1,2,3,4. (5.21) 

Using (5.19) with B = EkM-kiiz^) or B = EkM-k hr (5.19), we also have 


E\C 3 j \<K^ j = 1,2,3,4. (5.22) 

By (5.20), (5.15) and (5.19) with B = I„_i, we obtain 

71 

E\C Aj \<K~, j = 1,2,3,4,6. (5.23) 


Consider C32 now. Define Q and M*,*, the analogues of (i{z) and M^b) respectively, 
by (si,... ,Sfc,s fc+ i,... ,s„) T , where s fc+ i,...,s n are i.i.d. copies of s fc+ i,...,s„ and in¬ 
dependent of si,...,s n . Then Q. Mj,, have the same properties as (i(z), Mfcj(z), respec¬ 
tively. Therefore, |^| < \z\/v\ and ||Mfei|| < Kp. Applying (5.19) with B = Mfci(^i), we 
have 


e\c 32 \ = e 




np ■ 


T Mki(z 1 )^^Mki{z2)sisfMki(z 2 )s., 


i<k 


z 2 np 


K 


< 2 3 


n 2 p 3 y/np 


Y.E^lsjMk^M^Sif ■ E^sjMk^Sif (5.24) 


i<k 



< K 
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Third, consider C 25 . In view of (5.15), it is straightforward to check that 


C 25 =- m(z 2 ) + o Ll (l). 


(5.25) 


Further, consider C 34 . By (5.15) and (5.19), we have 


C34 = — 2 E k 

nr~ z ' 


np- 


i<k 


= -iE £ ‘ 


np- 


i<k 


Ci{z 1) 

ynp 

Ci(^i) 


sfM fci (2i)B 4 (z 2 )s l 


sfM fci (zi)F; fe 


0 (^2) 

zisjnp 


M ki (z 2 ) \sisjsi 


= zim(z 1 )m(z 2 )^ r ^y' sfE k M ki (z 1 )-E k M ki (z 2 )s i +o Ll (l) (5.26) 




i<k 


= zim(z 1 )m(z 2 )—^y^tT(E k M ki (z 1 ) ■ E k M ki (z 2 )) + o Ll (l) 




i<k 


= z\m(z\)m{z 2 )—h k + ol x ( 1 ), 
n 

where the last step uses the fact that via (5.11), (5.19), (5.8) and a tedious but elementary 
calculation 

^\tv(E k M ki ( Zl ) ■ E k M kl {z 2 )) - trE k (X k Dp( Zl )Xl) ■ E k (X k Bp (z 2 )X^)| < 
Consider C 45 finally. By (5.15), we have 


C45 = —m 2 (zi)m(z 2 )—K Oij ( 1 ). 

n 


(5.27) 


We conclude from (5.14), (5.18), (5.21)-(5.27) and the fact m 2 (z) + zm(z) + 1 = 0 that 

z\Z k = — —m(z 2 ) - -m 2 (zi)m{z 2 ) + - zim{z 1 )m{z 2 )Z k +o Ll (l) 
n n n 

k k 

= -zim{zi)m(z 2 ) + -Zim(zi)m(z 2 )Z k + ol 1 (1), 
n n 

which is equivalent to 


{k/n)m{z\)m{z 2 ) , ^ 

k 1 — (k / n)m{z\)m{z 2 ) Ll 


(5.28) 
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5.3. Proof of Condition 5.1 

The equality (5.28) ensures that 
1 


n 2 p 2 


^tr^M^) • E k M^\z 2 ) = - ^Z fc 


/c=l 


k=l 


[ tm{~i)m{~ 2 ) dt = -1 - ( m ( Zl )m( Z2 )) 1 log (1 - m(z 1 )m(z 2 )). 

J o 1 - tm{z 1 )m(z 2 ) 

Thus, via (5.6), we obtain 

A„(zi, z 2 ) -^4 v 4 - 3 - 2(m(z 1 )m(z 2 ))~ 1 log (1 - m(z 1 )m(z 2 )). 
Consequently, by (5.4) 


d 2 

A(zi, 2 2 ) = w —\(yi ~ ?>)m(zi)m(z 2 ) - 21 og(l - m{z 1 )m{z 2 ))\ 

OZ\ OZ2 

= m!{z\)m'(z 2 )[vi - 3 + 2(1 - m(zi)m(z 2 )y 2 ]. 


(5.29) 


5.4. Tightness of M^\z) 

This section is to prove the tightness of Mn\z) for z GC±. By (4.7) and Lemma 4.1, 


E 


EE ajE k -i{oik{zj)) 

fc= 1 J=i 


<^EE \°j\ 2 E \M z j)\ 2 < K , 

fe=ij=i 


which ensures condition (i) of Theorem 12.3 of Billingsley [ 8 ]. Condition (ii) of Theo¬ 
rem 12.3 of Billingsley [ 8 ] will be verified by proving 


E\m£\ Z i ) - m£\z 2 )\ 2 


<K, 21,22 e Ci. 


\zi - 22 r 

We employ the same notations as those in Section 4.1. Let 

T fcl = ^sI’X,D^ 1 ( 2 l )(D^ 1 (z 1 ) + D^ 1 (z 2 ))D A : 1 ( 2 2 )Xrs fe 


(5.30) 
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dki(z) = Pk{z)( 1 + —slMf\z)s k \ 

V np ) 

d k2 (z) = H-trM[ 2 ) (z), 

np 

d k3 = 1 + ^tvX k Tt^ 1 (z 2 )B^ 1 {z 1 )X.l, 
d k4 = -jL tr X.D - 1 (z^D " 1 (zi) + D-\z 2 ))-D-\z 2 )Xl 
As in (4.3), we write 

MW( Zi )-MW(z2) 

n 

= - TXE k - E k -i)(dki(zi) - dki(z 2 )) 

k =1 

n 

= -{zi - z 2 )YXE k - J5fc_i)[^ fc (^i)(Tfci +dfc 4 ) -/ 3 fc(j 2 i)dfci(^ 2 )(Tfc 2 + 4 3 )] 
fc=i 
n 

= -(zi - Z 2 ) ^2( E k - -Efc-l) 
fc=l 

X [(Zl + Z 2 ) + Z 3 - Pk{zi)pk{z 2 )d k 2dk3 - ^(^i)/ 3 fc(^ 2 )cZfc 37 fc(- 22 )] 

n 

= —(zi — z 2 ) ^"^(A’fc — £’fe_i)(Zi + Z 2 + Z 3 + Z 4 + Z 5 + Ze), 

fc=i 

where 

Zi = /3fc(zi)Tfci, Z 2 = /3fc(zi)/3fc r (zi)r?fe(zi)cZfe 4 , 

Z 3 = — ^fe(zi)Tfe 2 cZfci(zi), Z 4 = ~/3 k {zi)/3 t k I (z 1 )ri k (zi)l3k(z 2 )dk2{z 2 )dk3, 

k = ~l3 k {zi)P k (z 2 )Pl r (z 2 )^ k (z 2 )d k2 (z 2 )d k 3, Z 6 = —/3fe(zi)/3fc(z 2 )cZfc37fe(z 2 ). 

Here the last step uses (4.5) for /3 k (zi) and the facts that 

D^ 2 (Z!) - D)T 2 (z 2 ) = (zi - z 2 )D / 7 1 (z 1 )(D / 7 1 (z 1 ) + D fc - 1 (z 2 ))D^ 1 (z 2 ), 

/3fc(zi) - Pk(z 2 ) = (z 2 - z\)p k (zi)fik(z 2 )Tk 2 + {z 2 - Zi)/3 k (zi)/3 k (z 2 )dk3, 

(E k — E k _\)f3 k (z\)d k b = 0, (E k — E k _ 1 )^(z 1 )^(z 2 )d k2 (z 2 )d k3 = 0. 

By (4.6) and Lemma 8.10 in Bai and Silverstein [3], without any tedious calculations, 
one may verify that 


\d k j{z)\ < K, j = 1,2,3,4, 


and E\T k j \ 2 <Kp 1 


3 = 1,2. 
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The above inequalities, together with Burkholder’s inequality, imply (5.30). 

6. Uniform convergence of EM n (z ) 

To finish the proof of Proposition 3.1, it remains to derive an asymptotic expansion 
of n(Em n (z) — m(z)) for z£Ci (defined in Section 3.2). In order to unify the proof of 
Theorem 1.1 and Corollary 1.1, we derive the asymptotic expansion of n(Em n (z ) — m(z)) 
under both assumptions n/p—> 0 and n 3 /p = 0(1) in Proposition 6.1. For the purpose 
of proving (3.6), we will prove a stronger result in Proposition 6.1, namely uniform 
convergence of n(Em n (z) — rn{z)) for 2 £ s n = (J i=j r u ft- F° r z located in the wider range 
S n , the bounds or limits in Section 3 (e.g., Lemma 4.1, (5.3), (5.15)), cannot be applied 
directly. Hence in Section 4, we re-establish these and other useful results. Throughout 
this section, we assume 2 £ s n and use the same notations as those in Section 3. 

Proposition 6 . 1 . Suppose that assumption (c) is satisfied. 

(i) Under assumption (bl): n/p—> 0, we have the following asymptotic expansion 


n[Em n (z) - m(z) - X n (m(z))\ = o(l) 


( 6 . 1 ) 


uniformly for z € <; n = {J i=l r u Si, where X n (m) is defined in (l.f). 

(ii) Under assumption (b2): n 3 /p= 0(1), we have the following asymptotic expansion 



( 6 . 2 ) 


m 3 (z)(m'(z) + 1/4 — 2)(1 + m'(z)) + o(l), 


uniformly for z&s n = Ui=;, r ,u ft ■ 


This, together with (5.29) and the tightness of Mn\z) in Section 5.4, implies Propo¬ 
sition 3.1. It remains to prove Proposition 6.1. To facilitate statements, let 




1 


Here, uj n ,e n all depend on 2 and n, and e n are non-random. 


Lemma 6 . 1 . Letz€s n ■ We have 


nEiv n = m 3 (z)(m'(z) + ^4 — 2) + o(l). 
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Assuming that Lemma 6.1 is true for the moment, whose proof is given in Section 6.1 
below, let us demonstrate how to get Proposition 6.1. By (3.8) in Bai [4], we obtain 

1 1 . n 

m n {z) = -trD^z) =- V'/3 fc . (6.3) 

n n z ' 

k =1 

Applying (4.1), (6.3), (4.6) and taking the difference between 8k and we have 

1 " 

Em n (z) - m(z) = - Epk + 

n ^' 


l 


fc=i 


z + m(z) 


= E-Y^mjz) 

n a —^ 


k= 1 


Hk - ( Em n (z) - m(z)) - \/-( 1 + zEm n (z)) 


(6.4) 


= Eu> n + m(z)Em n (z)(Em n (z) — mlz)) + , / —m(z)Em n (z)( 1 + zEm n (z)). 

V P 

Under assumption nfp^r 0: Let Em n ,m respectively, denote Em n (z),m(z ) to simplify 
the notations below. By (4.1) and (6.4), we have 

Em n — m = Eiij n + m 2 (Em n — m) + m(Em n — m) 2 + . —m(Em n — m)( 1 + zm) 


+ . —m 2 { 1 + zm) + . —zm(Em n — m ) 2 + , — zm 2 (Em n — m ) 


= A(Em n — m) 2 + (B + 1 )(Em n — m) + C n , 
where A, B are defined in (1.4) and 


C n — Euj n 



Rearranging the above equation, we observe that ( Em n — m) satisfies the equation Ax 2 + 
Bx + C n = 0. Solving the equation, we obtain 


- B + VB 2 - 4 AC n -B - VB 2 - 4AC„ 

X(1) ~ 2 A ’ X(2) ~ 2 A ’ 

where \JB 2 — 4 AC n is a complex number whose imaginary part has the same sign as that 
of B. By the assumption n/p —>• 0 and Lemma 6.1, we have 4 AC n —> 0. Then x/^ = o(l) 
and X( 2 ) = ^ o(l). Since Em n — m = o(l) by (6.5), we choose Em n — m = £(i)- 

Applying Lemma 6.1 and the definition of X n (m) in (1.4), we have 


n[Em n (z) - m(z) - X n {m{z))\ 


—4 A[nEaj n — m 3 (z)(m'(z) + iq — 2)] 
2A{VB t ~U4Jc^ + VB 2 - 4 AC) 















CLT for normalized sample covariance matrices 


41 


-> 0 . 


Hence Proposition 6.1 (i) is proved. 

Under assumption n 3 /p = 0(1): subtracting m(z)Em n (z)(Em n (z) — m(z)) on the both 
sides of (6.4) and then dividing 1(1 — m(z)Em n (z)), we have 


n(Em„.(z) — m(z)) = 


nEtOr, 


™ 3 {z) , 



1 — m 2 (z) 


2 / \ 

where we use (4.34), Lemma 6.1, (4.1) and the fact that m!{z) = . Proposi¬ 

tion 6.1 (ii) is proved. Hence, the proof of Proposition 6.1 is completed. Now it remains 
to prove Lemma 6.1. 


6.1. Proof of Lemma 6.1 

From the definitions of 8 k , S n and p, k (see Lemma 4.3), we obtain 

8k=£n+8k£nfik- (6.5) 

By (6.5), we further write 8k as 8k = S n + s^pk + £n8 k + Pk^nPl, which ensures that 


nEuj n = m{z)e n ^ E(p k ) + m(z)e 1 n ^ E(p 2 k ) 

k =1 k =1 

n n 

+ rn(z)e 3 n ^ E(p 3 k ) + m(z)4 E (Pk8k) 


( 6 . 6 ) 


k= 1 


fc=l 


= Hi + H 2 + H 3 + H 4 , 


where H 3 . j = 1,2,3,4 are defined in the obvious way. As will be seen, H 3 and H 4 are 
both negligible and the contribution to the limit of nEu} n comes from Hi and H 2 . Now, 
we analyze Hj,j = 1,..., 4 one by one. 

Consider H 4 first. It follows from (4.1) and (4.34) that 


E-n — 


z + m(z) +o(l) 
By Lemma 4.2 and Lemma 4.3, 


= —m(z) + o(l). 


<5 4 


(6.7) 


E\8kPl\ < KE\pi\I{U c n ) + E\8kfit\I(U n ) < K - + -= + o (n~ L ) < KS 


n p* 


— 1 
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which, together with (6.7), further implies 

#4 = 0 ( 1 ). 

It follows from Lemma 4.3 and (6.7) that 

#3 = 0(1). 

Consider Hi next. We have, via (4.6) and (4.2), 


#i =m(z)e n V( E— trXD- 1 X r -£— trM 
' np np 


( 6 . 8 ) 


(6.9) 


= 1 + z 


fc=l 
n 
P 


= - 1 + 2 


(z)e n ^ E(tr D 1 - tr D fc x ) + s j^m{z)e n (6.10) 

^m{z)£ n ^Y j E + +^m(z)e n . 

Applying (4.23), (4.26) and (4.34), it is easy to see 

1 + = 1 + (^~ tr Mjr 1 ^ + o L 4 (l) = l + m'(z)+o i4 (l). 

This, together with (6.7), Lemma 4.2 and (6.3), ensures that 

Hi = —m 2 (z)( 1 + m' (z))Em n (z) + o(l) —> —m 3 (z)( 1 + m'(z)). ( 6 - 11 ) 

Consider #2 now. By the previous estimation of Ejl k included in Hi we obtain 

Efll=E{fl k -Efl k ) 2 + 0{n- 2 ). (6.12) 

Furthermore a direct calculation yields 

E(fL k -Efl k ) 2 = Si+S 2 , (6.13) 


where 

Si = —E(X 2 i — 1 ) + Ejh, S2 = S21 + S22, 
n 

^i = ^2#(trM«-EtrM«) 2 , 
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We claim that 

nS\ —> ^4 — 1 + 2 m , (z), 11 S 2 1 —tO, nS 22 —> 0 , as n->oo. (6.14) 

Indeed, with notation = (a[^)p X p,i,j = 1 as illustrated in (4.16), we have 


n*p 


r-r J2k =1 Si=i E \ a ^| 2 —t 0. Via this, (4.34) and (4.6), a simple calculation yields 


( p 

nE 7 fc 2 ! = ^ ( E - !) 


a„-P 




= —-£ EE X-ikXjkXskXtkQ 1 - 


np* 


UUi) 




np* 




^2 b (e 4 1)a ?) + °( 1 )=^^mm^) 2 +°( 1 ) 




= — .EtrD, 2 + o(l) —>• 2m / (z). 
n 


Since E\Xf x — 1 | 2 = v 4 — 1 , we have proved the first result of (6.14). By Burkholder’s 
inequality, Lemma 4.4, (4.6), (4.18) and (4.23) 

n\S 2 i\ = K^l + z^j jE\M^Xz )\ 2 + Kn - 1 < Kn- 1 . (6.15) 


Furthermore, 


i| 'S'22 1 — 


E 


VyfEp y t=1 


f^(X t 2 fc -i)V 




- 1 )X 2 ! • BtrM^I < KJ- + o(n~ t ) 0. 


Py/rvp 

Therefore, the proof of the second result of (6.14) is completed. We then conclude from 
(6.14), (6.12), (6.13) and (6.7) that 


H 2 —> m 3 (z)(2m'(z) + v 4 — 1). 
Finally, by ( 6 . 6 ), ( 6 . 8 ), (6.9), (6.11) and (6.16), we obtain 
nEuj n = m 3 (z)(m'(z) + 124 — 2) + o(l). 


(6.16) 


Lemma 6.1 is thus proved. This finishes the proof of Proposition 3.1. 
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7. Proof of Proposition 3.2 

Recall the definition of U n below Proposition 3.2 or in Section 4. For i = l,r, by Lemma 4.4 



as n —> oo, vi —i 0. 


Moreover, 




where the convergence follows from Proposition 6.1. 

8. Calculation of the mean and covariance 

To complete the proof of Theorem 1.1 and Corollary 1.1, it remains to calculate the mean 
function and covariance function of Y(f) and X(f). The computation exactly follows Bai 
and Yao [6] and so we omit it. 
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